Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 3000 |
| Missing cells | 2 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 257.8 KiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 3 |
carat is highly overall correlated with price and 3 other fields | High correlation |
price is highly overall correlated with carat and 3 other fields | High correlation |
x is highly overall correlated with carat and 3 other fields | High correlation |
y is highly overall correlated with carat and 3 other fields | High correlation |
z is highly overall correlated with carat and 3 other fields | High correlation |
Reproduction
| Analysis started | 2025-01-09 18:43:09.152421 |
|---|---|
| Analysis finished | 2025-01-09 18:43:57.560699 |
| Duration | 48.41 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
carat
Real number (ℝ)
High correlation 
| Distinct | 193 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.79416333 |
| Minimum | 0.21 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 46.9 KiB |
Quantile statistics
| Minimum | 0.21 |
|---|---|
| 5-th percentile | 0.3 |
| Q1 | 0.4 |
| median | 0.7 |
| Q3 | 1.05 |
| 95-th percentile | 1.71 |
| Maximum | 3.5 |
| Range | 3.29 |
| Interquartile range (IQR) | 0.65 |
Descriptive statistics
| Standard deviation | 0.4723126 |
|---|---|
| Coefficient of variation (CV) | 0.5947298 |
| Kurtosis | 1.0427297 |
| Mean | 0.79416333 |
| Median Absolute Deviation (MAD) | 0.32 |
| Skewness | 1.1054686 |
| Sum | 2382.49 |
| Variance | 0.22307919 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.3 | 138 | 4.6% |
| 0.31 | 122 | 4.1% |
| 1.01 | 121 | 4.0% |
| 0.7 | 100 | 3.3% |
| 0.32 | 96 | 3.2% |
| 0.9 | 84 | 2.8% |
| 0.33 | 80 | 2.7% |
| 0.41 | 79 | 2.6% |
| 1 | 74 | 2.5% |
| 0.71 | 74 | 2.5% |
| Other values (183) | 2032 |
| Value | Count | Frequency (%) |
| 0.21 | 1 | < 0.1% |
| 0.23 | 23 | 0.8% |
| 0.24 | 8 | 0.3% |
| 0.25 | 13 | 0.4% |
| 0.26 | 11 | 0.4% |
| 0.27 | 9 | 0.3% |
| 0.28 | 12 | 0.4% |
| 0.29 | 7 | 0.2% |
| 0.3 | 138 | |
| 0.31 | 122 |
| Value | Count | Frequency (%) |
| 3.5 | 1 | |
| 3 | 1 | |
| 2.77 | 1 | |
| 2.56 | 1 | |
| 2.54 | 1 | |
| 2.5 | 1 | |
| 2.46 | 1 | |
| 2.44 | 1 | |
| 2.42 | 1 | |
| 2.38 | 1 |
cut
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 46.9 KiB |
| Ideal | |
|---|---|
| Premium | |
| Very Good | |
| Good | |
| Fair | 85 |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.291 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Very Good |
|---|---|
| 2nd row | Premium |
| 3rd row | Premium |
| 4th row | Ideal |
| 5th row | Good |
Common Values
| Value | Count | Frequency (%) |
| Ideal | 1208 | |
| Premium | 765 | |
| Very Good | 674 | |
| Good | 268 | 8.9% |
| Fair | 85 | 2.8% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| ideal | 1208 | |
| good | 942 | |
| premium | 765 | |
| very | 674 | |
| fair | 85 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2647 | |
| d | 2150 | |
| o | 1884 | |
| m | 1530 | |
| r | 1524 | |
| a | 1293 | 6.9% |
| I | 1208 | 6.4% |
| l | 1208 | 6.4% |
| G | 942 | 5.0% |
| i | 850 | 4.5% |
| Other values (6) | 3637 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 18873 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2647 | |
| d | 2150 | |
| o | 1884 | |
| m | 1530 | |
| r | 1524 | |
| a | 1293 | 6.9% |
| I | 1208 | 6.4% |
| l | 1208 | 6.4% |
| G | 942 | 5.0% |
| i | 850 | 4.5% |
| Other values (6) | 3637 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 18873 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2647 | |
| d | 2150 | |
| o | 1884 | |
| m | 1530 | |
| r | 1524 | |
| a | 1293 | 6.9% |
| I | 1208 | 6.4% |
| l | 1208 | 6.4% |
| G | 942 | 5.0% |
| i | 850 | 4.5% |
| Other values (6) | 3637 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 18873 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2647 | |
| d | 2150 | |
| o | 1884 | |
| m | 1530 | |
| r | 1524 | |
| a | 1293 | 6.9% |
| I | 1208 | 6.4% |
| l | 1208 | 6.4% |
| G | 942 | 5.0% |
| i | 850 | 4.5% |
| Other values (6) | 3637 |
color
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 46.9 KiB |
| G | |
|---|---|
| E | |
| F | |
| H | |
| D | |
| Other values (2) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | E |
|---|---|
| 2nd row | G |
| 3rd row | F |
| 4th row | F |
| 5th row | H |
Common Values
| Value | Count | Frequency (%) |
| G | 627 | |
| E | 571 | |
| F | 519 | |
| H | 486 | |
| D | 352 | |
| I | 275 | |
| J | 169 | 5.6% |
| (Missing) | 1 | < 0.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| g | 627 | |
| e | 571 | |
| f | 519 | |
| h | 486 | |
| d | 352 | |
| i | 275 | |
| j | 169 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 627 | |
| E | 571 | |
| F | 519 | |
| H | 486 | |
| D | 352 | |
| I | 275 | |
| J | 169 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2999 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| G | 627 | |
| E | 571 | |
| F | 519 | |
| H | 486 | |
| D | 352 | |
| I | 275 | |
| J | 169 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2999 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| G | 627 | |
| E | 571 | |
| F | 519 | |
| H | 486 | |
| D | 352 | |
| I | 275 | |
| J | 169 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2999 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| G | 627 | |
| E | 571 | |
| F | 519 | |
| H | 486 | |
| D | 352 | |
| I | 275 | |
| J | 169 | 5.6% |
clarity
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 46.9 KiB |
| SI1 | |
|---|---|
| VS2 | |
| SI2 | |
| VS1 | |
| VVS2 | |
| Other values (3) |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.1203333 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SI2 |
|---|---|
| 2nd row | VS1 |
| 3rd row | SI1 |
| 4th row | VS2 |
| 5th row | SI2 |
Common Values
| Value | Count | Frequency (%) |
| SI1 | 759 | |
| VS2 | 673 | |
| SI2 | 477 | |
| VS1 | 442 | |
| VVS2 | 292 | 9.7% |
| VVS1 | 213 | 7.1% |
| IF | 104 | 3.5% |
| I1 | 40 | 1.3% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| si1 | 759 | |
| vs2 | 673 | |
| si2 | 477 | |
| vs1 | 442 | |
| vvs2 | 292 | 9.7% |
| vvs1 | 213 | 7.1% |
| if | 104 | 3.5% |
| i1 | 40 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 2856 | |
| V | 2125 | |
| 1 | 1454 | |
| 2 | 1442 | |
| I | 1380 | |
| F | 104 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9361 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 2856 | |
| V | 2125 | |
| 1 | 1454 | |
| 2 | 1442 | |
| I | 1380 | |
| F | 104 | 1.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9361 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 2856 | |
| V | 2125 | |
| 1 | 1454 | |
| 2 | 1442 | |
| I | 1380 | |
| F | 104 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9361 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 2856 | |
| V | 2125 | |
| 1 | 1454 | |
| 2 | 1442 | |
| I | 1380 | |
| F | 104 | 1.1% |
depth
Real number (ℝ)
| Distinct | 110 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 61.738033 |
| Minimum | 54.3 |
|---|---|
| Maximum | 78.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 46.9 KiB |
Quantile statistics
| Minimum | 54.3 |
|---|---|
| 5-th percentile | 59.2 |
| Q1 | 61.1 |
| median | 61.9 |
| Q3 | 62.5 |
| 95-th percentile | 63.7 |
| Maximum | 78.2 |
| Range | 23.9 |
| Interquartile range (IQR) | 1.4 |
Descriptive statistics
| Standard deviation | 1.4240316 |
|---|---|
| Coefficient of variation (CV) | 0.023065711 |
| Kurtosis | 8.0627226 |
| Mean | 61.738033 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | 0.33429195 |
| Sum | 185214.1 |
| Variance | 2.0278661 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 62 | 136 | 4.5% |
| 61.9 | 126 | 4.2% |
| 62.1 | 121 | 4.0% |
| 62.2 | 115 | 3.8% |
| 62.4 | 113 | 3.8% |
| 61.6 | 110 | 3.7% |
| 61.5 | 102 | 3.4% |
| 61.8 | 102 | 3.4% |
| 62.3 | 99 | 3.3% |
| 61.7 | 99 | 3.3% |
| Other values (100) | 1877 |
| Value | Count | Frequency (%) |
| 54.3 | 1 | < 0.1% |
| 55.3 | 1 | < 0.1% |
| 56.2 | 1 | < 0.1% |
| 56.7 | 1 | < 0.1% |
| 56.8 | 2 | 0.1% |
| 56.9 | 1 | < 0.1% |
| 57 | 3 | |
| 57.1 | 1 | < 0.1% |
| 57.2 | 5 | |
| 57.3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 78.2 | 1 | |
| 69.7 | 1 | |
| 68.8 | 1 | |
| 68.3 | 1 | |
| 68.2 | 1 | |
| 67.9 | 1 | |
| 67.8 | 1 | |
| 67.7 | 1 | |
| 67.6 | 1 | |
| 66.8 | 1 |
table
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.435433 |
| Minimum | 52 |
|---|---|
| Maximum | 95 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 46.9 KiB |
Quantile statistics
| Minimum | 52 |
|---|---|
| 5-th percentile | 54 |
| Q1 | 56 |
| median | 57 |
| Q3 | 59 |
| 95-th percentile | 61 |
| Maximum | 95 |
| Range | 43 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.308361 |
|---|---|
| Coefficient of variation (CV) | 0.040190539 |
| Kurtosis | 23.532243 |
| Mean | 57.435433 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.0254125 |
| Sum | 172306.3 |
| Variance | 5.3285307 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 57 | 550 | |
| 56 | 544 | |
| 58 | 479 | |
| 55 | 359 | |
| 59 | 347 | |
| 60 | 239 | |
| 54 | 151 | 5.0% |
| 61 | 110 | 3.7% |
| 62 | 68 | 2.3% |
| 63 | 29 | 1.0% |
| Other values (42) | 124 | 4.1% |
| Value | Count | Frequency (%) |
| 52 | 4 | 0.1% |
| 53 | 25 | 0.8% |
| 53.5 | 2 | 0.1% |
| 53.6 | 1 | < 0.1% |
| 53.7 | 2 | 0.1% |
| 53.8 | 2 | 0.1% |
| 53.9 | 2 | 0.1% |
| 54 | 151 | |
| 54.1 | 4 | 0.1% |
| 54.2 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 95 | 1 | < 0.1% |
| 66 | 9 | 0.3% |
| 65 | 12 | 0.4% |
| 64.3 | 1 | < 0.1% |
| 64 | 18 | 0.6% |
| 63 | 29 | 1.0% |
| 62 | 68 | |
| 61.2 | 1 | < 0.1% |
| 61 | 110 | |
| 60.7 | 1 | < 0.1% |
price
Real number (ℝ)
High correlation 
| Distinct | 2191 |
|---|---|
| Distinct (%) | 73.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3887.0053 |
| Minimum | 371 |
|---|---|
| Maximum | 18731 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 46.9 KiB |
Quantile statistics
| Minimum | 371 |
|---|---|
| 5-th percentile | 551.65 |
| Q1 | 955.5 |
| median | 2369.5 |
| Q3 | 5342.5 |
| 95-th percentile | 12757.8 |
| Maximum | 18731 |
| Range | 18360 |
| Interquartile range (IQR) | 4387 |
Descriptive statistics
| Standard deviation | 3944.0354 |
|---|---|
| Coefficient of variation (CV) | 1.014672 |
| Kurtosis | 2.3229782 |
| Mean | 3887.0053 |
| Median Absolute Deviation (MAD) | 1640.5 |
| Skewness | 1.6430976 |
| Sum | 11661016 |
| Variance | 15555416 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 605 | 11 | 0.4% |
| 765 | 11 | 0.4% |
| 687 | 9 | 0.3% |
| 802 | 9 | 0.3% |
| 789 | 8 | 0.3% |
| 544 | 8 | 0.3% |
| 625 | 8 | 0.3% |
| 645 | 8 | 0.3% |
| 828 | 7 | 0.2% |
| 1013 | 7 | 0.2% |
| Other values (2181) | 2914 |
| Value | Count | Frequency (%) |
| 371 | 1 | < 0.1% |
| 382 | 1 | < 0.1% |
| 386 | 1 | < 0.1% |
| 393 | 2 | |
| 394 | 4 | |
| 402 | 4 | |
| 404 | 1 | < 0.1% |
| 408 | 4 | |
| 412 | 1 | < 0.1% |
| 413 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 18731 | 1 | |
| 18717 | 1 | |
| 18604 | 1 | |
| 18525 | 1 | |
| 18522 | 1 | |
| 18515 | 1 | |
| 18493 | 1 | |
| 18475 | 1 | |
| 18430 | 1 | |
| 18374 | 1 |
x
Real number (ℝ)
High correlation 
| Distinct | 449 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.7242948 |
| Minimum | 3.89 |
|---|---|
| Maximum | 9.65 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 46.9 KiB |
Quantile statistics
| Minimum | 3.89 |
|---|---|
| 5-th percentile | 4.29 |
| Q1 | 4.715 |
| median | 5.68 |
| Q3 | 6.54 |
| 95-th percentile | 7.65 |
| Maximum | 9.65 |
| Range | 5.76 |
| Interquartile range (IQR) | 1.825 |
Descriptive statistics
| Standard deviation | 1.1187719 |
|---|---|
| Coefficient of variation (CV) | 0.19544275 |
| Kurtosis | -0.68177371 |
| Mean | 5.7242948 |
| Median Absolute Deviation (MAD) | 0.92 |
| Skewness | 0.42679852 |
| Sum | 17167.16 |
| Variance | 1.2516506 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4.32 | 30 | 1.0% |
| 4.44 | 25 | 0.8% |
| 4.33 | 25 | 0.8% |
| 4.28 | 24 | 0.8% |
| 4.41 | 23 | 0.8% |
| 5.7 | 23 | 0.8% |
| 4.36 | 22 | 0.7% |
| 4.37 | 21 | 0.7% |
| 4.34 | 21 | 0.7% |
| 4.35 | 21 | 0.7% |
| Other values (439) | 2764 |
| Value | Count | Frequency (%) |
| 3.89 | 1 | < 0.1% |
| 3.9 | 3 | |
| 3.91 | 1 | < 0.1% |
| 3.92 | 3 | |
| 3.93 | 2 | |
| 3.94 | 4 | |
| 3.95 | 3 | |
| 3.96 | 4 | |
| 3.97 | 4 | |
| 3.98 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.65 | 1 | |
| 9.42 | 1 | |
| 8.93 | 1 | |
| 8.83 | 1 | |
| 8.82 | 1 | |
| 8.79 | 1 | |
| 8.76 | 1 | |
| 8.75 | 1 | |
| 8.62 | 1 | |
| 8.61 | 1 |
y
Real number (ℝ)
High correlation 
| Distinct | 452 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.7271833 |
| Minimum | 3.86 |
|---|---|
| Maximum | 9.59 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 46.9 KiB |
Quantile statistics
| Minimum | 3.86 |
|---|---|
| 5-th percentile | 4.31 |
| Q1 | 4.72 |
| median | 5.68 |
| Q3 | 6.53 |
| 95-th percentile | 7.6305 |
| Maximum | 9.59 |
| Range | 5.73 |
| Interquartile range (IQR) | 1.81 |
Descriptive statistics
| Standard deviation | 1.1104327 |
|---|---|
| Coefficient of variation (CV) | 0.1938881 |
| Kurtosis | -0.6954507 |
| Mean | 5.7271833 |
| Median Absolute Deviation (MAD) | 0.91 |
| Skewness | 0.42128093 |
| Sum | 17181.55 |
| Variance | 1.2330608 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4.35 | 29 | 1.0% |
| 4.38 | 26 | 0.9% |
| 4.31 | 26 | 0.9% |
| 4.33 | 26 | 0.9% |
| 4.4 | 26 | 0.9% |
| 4.32 | 24 | 0.8% |
| 4.46 | 23 | 0.8% |
| 4.41 | 23 | 0.8% |
| 6.38 | 22 | 0.7% |
| 4.39 | 21 | 0.7% |
| Other values (442) | 2754 |
| Value | Count | Frequency (%) |
| 3.86 | 1 | < 0.1% |
| 3.9 | 1 | < 0.1% |
| 3.93 | 2 | 0.1% |
| 3.94 | 1 | < 0.1% |
| 3.96 | 2 | 0.1% |
| 3.97 | 2 | 0.1% |
| 3.98 | 4 | |
| 3.99 | 2 | 0.1% |
| 4 | 5 | |
| 4.01 | 5 |
| Value | Count | Frequency (%) |
| 9.59 | 1 | |
| 9.26 | 1 | |
| 8.83 | 2 | |
| 8.78 | 1 | |
| 8.76 | 1 | |
| 8.73 | 1 | |
| 8.69 | 1 | |
| 8.59 | 1 | |
| 8.58 | 1 | |
| 8.56 | 1 |
z
Real number (ℝ)
High correlation 
| Distinct | 295 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5332267 |
| Minimum | 1.07 |
|---|---|
| Maximum | 6.03 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 46.9 KiB |
Quantile statistics
| Minimum | 1.07 |
|---|---|
| 5-th percentile | 2.66 |
| Q1 | 2.91 |
| median | 3.51 |
| Q3 | 4.04 |
| 95-th percentile | 4.73 |
| Maximum | 6.03 |
| Range | 4.96 |
| Interquartile range (IQR) | 1.13 |
Descriptive statistics
| Standard deviation | 0.6891764 |
|---|---|
| Coefficient of variation (CV) | 0.19505581 |
| Kurtosis | -0.70181487 |
| Mean | 3.5332267 |
| Median Absolute Deviation (MAD) | 0.57 |
| Skewness | 0.38923957 |
| Sum | 10599.68 |
| Variance | 0.47496411 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.69 | 43 | 1.4% |
| 2.7 | 41 | 1.4% |
| 2.68 | 41 | 1.4% |
| 2.71 | 38 | 1.3% |
| 2.74 | 37 | 1.2% |
| 2.73 | 36 | 1.2% |
| 4.05 | 35 | 1.2% |
| 2.66 | 35 | 1.2% |
| 2.72 | 34 | 1.1% |
| 2.75 | 34 | 1.1% |
| Other values (285) | 2626 |
| Value | Count | Frequency (%) |
| 1.07 | 1 | < 0.1% |
| 2.29 | 1 | < 0.1% |
| 2.36 | 1 | < 0.1% |
| 2.37 | 1 | < 0.1% |
| 2.39 | 1 | < 0.1% |
| 2.4 | 2 | 0.1% |
| 2.42 | 4 | |
| 2.43 | 7 | |
| 2.44 | 5 | |
| 2.45 | 3 |
| Value | Count | Frequency (%) |
| 6.03 | 1 | |
| 5.58 | 1 | |
| 5.56 | 1 | |
| 5.48 | 1 | |
| 5.41 | 1 | |
| 5.37 | 2 | |
| 5.3 | 1 | |
| 5.29 | 1 | |
| 5.28 | 1 | |
| 5.27 | 1 |
Interactions
Correlations
| carat | clarity | color | cut | depth | price | table | x | y | z | |
|---|---|---|---|---|---|---|---|---|---|---|
| carat | 1.000 | 0.171 | 0.132 | 0.117 | 0.020 | 0.965 | 0.195 | 0.997 | 0.996 | 0.995 |
| clarity | 0.171 | 1.000 | 0.073 | 0.114 | 0.108 | 0.223 | 0.074 | 0.183 | 0.183 | 0.192 |
| color | 0.132 | 0.073 | 1.000 | 0.036 | 0.000 | 0.237 | 0.031 | 0.143 | 0.135 | 0.140 |
| cut | 0.117 | 0.114 | 0.036 | 1.000 | 0.386 | 0.224 | 0.323 | 0.267 | 0.113 | 0.132 |
| depth | 0.020 | 0.108 | 0.000 | 0.386 | 1.000 | 0.004 | -0.231 | -0.032 | -0.035 | 0.089 |
| price | 0.965 | 0.223 | 0.237 | 0.224 | 0.004 | 1.000 | 0.169 | 0.965 | 0.965 | 0.960 |
| table | 0.195 | 0.074 | 0.031 | 0.323 | -0.231 | 0.169 | 1.000 | 0.200 | 0.195 | 0.163 |
| x | 0.997 | 0.183 | 0.143 | 0.267 | -0.032 | 0.965 | 0.200 | 1.000 | 0.998 | 0.989 |
| y | 0.996 | 0.183 | 0.135 | 0.113 | -0.035 | 0.965 | 0.195 | 0.998 | 1.000 | 0.988 |
| z | 0.995 | 0.192 | 0.140 | 0.132 | 0.089 | 0.960 | 0.163 | 0.989 | 0.988 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
| carat | cut | color | clarity | depth | table | price | x | y | z | |
|---|---|---|---|---|---|---|---|---|---|---|
| 15110 | 1.25 | Very Good | E | SI2 | 60.8 | 55.0 | 6073 | 6.94 | 7.00 | 4.24 |
| 311 | 0.74 | Premium | G | VS1 | 62.9 | 60.0 | 2800 | 5.74 | 5.68 | 3.59 |
| 53261 | 0.74 | Premium | F | SI1 | 61.4 | 59.0 | 2648 | 5.81 | 5.82 | 3.57 |
| 34196 | 0.33 | Ideal | F | VS2 | 61.9 | 55.0 | 854 | 4.46 | 4.43 | 2.75 |
| 2094 | 0.90 | Good | H | SI2 | 60.4 | 61.0 | 3114 | 6.14 | 6.22 | 3.73 |
| 23300 | 1.52 | Ideal | H | VS2 | 61.8 | 55.1 | 11333 | 7.38 | 7.42 | 4.58 |
| 10632 | 1.01 | Very Good | E | SI1 | 63.2 | 59.0 | 4830 | 6.28 | 6.25 | 3.96 |
| 43754 | 0.61 | Premium | I | VS2 | 61.8 | 56.0 | 1438 | 5.44 | 5.41 | 3.35 |
| 25562 | 2.14 | Good | H | SI1 | 57.5 | 60.0 | 14395 | 8.57 | 8.48 | 4.90 |
| 5120 | 0.91 | Very Good | H | SI1 | 62.7 | 56.0 | 3762 | 6.14 | 6.18 | 3.86 |
| carat | cut | color | clarity | depth | table | price | x | y | z | |
|---|---|---|---|---|---|---|---|---|---|---|
| 17363 | 1.20 | Ideal | H | SI1 | 61.1 | 56.0 | 6968 | 6.92 | 6.87 | 4.21 |
| 50922 | 0.70 | Ideal | F | SI1 | 61.6 | 56.0 | 2319 | 5.73 | 5.67 | 3.51 |
| 34134 | 0.30 | Ideal | F | VVS1 | 62.0 | 54.2 | 854 | 4.31 | 4.33 | 2.68 |
| 22399 | 2.77 | Premium | H | I1 | 62.6 | 62.0 | 10424 | 8.93 | 8.83 | 5.56 |
| 24239 | 1.90 | Ideal | H | VS2 | 61.7 | 55.0 | 12443 | 7.9 | 7.81 | 4.85 |
| 5541 | 0.71 | Ideal | D | VVS2 | 61.3 | 57.0 | 3856 | 5.7 | 5.81 | 3.53 |
| 13001 | 1.22 | Premium | I | VS2 | 60.1 | 58.0 | 5405 | 6.92 | 7.00 | 4.18 |
| 1992 | 0.91 | Premium | F | SI2 | 62.1 | 56.0 | 3096 | 6.26 | 6.21 | 3.87 |
| 52231 | 0.72 | Premium | H | VS2 | 61.4 | 57.0 | 2484 | 5.79 | 5.75 | 3.54 |
| 33992 | 0.42 | Ideal | G | VS2 | 62.4 | 54.0 | 847 | 4.78 | 4.83 | 3.00 |